Run the following command:

python parse_hackforums.py <input-dir> <output-dir>  <output-dir-only-initial-post>

input-dir: contains the raw data in .gz format
output-dir: outputs all the posts and all threads
output-dir-only-initial-post: only outputs the first post of a thread
